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SYNTHESIS OF HUMAN PROCOLLAGENS AND COLLAGENS 
IN RECOMBINANT DNA SYSTEMS 

GOVERNMENT RIGHTS 

This invention was made in the course of research 
5 supported in part by NIH grants AR38188 and AR39740. The 
Government may have certain rights in this invention, 

BACKGROUND OF THE INVENTION 

Expression of many exogenous genes is readily obtained 
in a variety of recombinant host-vector systems, but becomes 

10 difficult to obtain if the protein normally requires extensive 
post-translational processing. This is the likely reason that 
expression in a fully recombinant system has not been reported 
for any of the major fibrillar collagens that require 
processing by post-translational enzymes. See Prockop and 

15 Kivirikko, N. Engl. J. Med. 1984, 311, 376-386. Prolyl 4- 
hydroxylase is probably one of the most important post- 
translational enzyme necessary for synthesis of procollagen or 
collagen by cells because it is required to hydroxy late prolyl 
residues in the Y-position of the repeating -Gly-X-Y- sequences 

20 to 4-hydroxyproline. Prockop and Kivirikko, N. Engl* J. Med. 
1984, 311, 376-386. Unless an appropriate number of Y-position 
prolyl residues are hydroxylated to 4-hydroxyproline by prolyl 
4 -hydroxylase, the newly synthesize chains cannot fold into a 
triple-helical conformation at 37°C. If the hydroxy lation does 

25 not occur, the polypeptides remain non-helical, are poorly 
secreted by cells, and cannot self -assemble into collagen 
fibrils. Recently, prolyl 4-hydroxylase, was expressed in 
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baculovirus. vuorio, K. et al., Proceedings of the National 
Academy of Science, U.S.A., 1992, 89, 7467-7470. 

Schnieke et al. , Proa. Natl. Acad Sci. U.S.A. 1987, 
84, 8869-8873 and Lee et al. , J. Biol. Chem. 1989, 264, 20683- 
5 20687, disclose rescue experiments in two different systems 
that synthesized only one of the two chains for type I 
procollagen. Schnieke et al. reported that a gene for the 
human fibrillar collagen proal(I) chain, the COL1A1 gene, can 
be expressed in mouse fibroblasts and that the chains are used 

10 to assemble molecules of type I procollagen, the precursor of 
type I collagen. However, in this system the proa2(I) chains 
found in the same molecule are of mouse origin. In the system 
of Lee et al. the proal(I) chains are of rat origin. Thus, 
synthesis of a procollagen molecule in which all three chains 

15 are derived from an exogenous gene was not obtained by either 
Schnieke et al. or Lee et al. 

Failure to obtain expression of genes for fibrillar 
collagens in a fully recombinant system has hampered attempts 
to study the normal structure-function relationships of the 

20 proteins and to study the effects of mutations. In particular, 
mutations in the gene for type II procollagen have recently 
been implicated as the cause of several human diseases, 
Anderson et al., Am. J. Hum. Genet. 1990, 46, 896-901; Tiller 
et al., Proc. Natl. Acad. Sci. U.S.A. 1990, 87, 3889-3893; 

25 Vissing et al., J. Biol. Chem. 1990, 264, 18265-18267; Lee et 
al., Science 1989, 244, 978-980; Francomano et al., Genomics 
1987, 1, 293-296; Knowlton et al. , Am. J. Hum. Genet. 1989, 45, 
681-688; Ahmad et al. , Am. J. Hum. Genet. 1990, 47, A206; 
Palotie et al. , The Lancet 1989, I, 924-927; Knowlton et al., 

30 N. Engl. J. Med. 1990, 322, 526-530; Ala-Kokko et al., Proc. 
Natl. Acad. Sci. U.S.A. 1990, 87, 6565-6568, but because 
adequate numbers of human cartilage cells are difficult to 
obtain and because human chondrocytes . readily lose their 
phenotype in culture, Elima and Vuorio, FEBS Lett. 1989, 258, 

35 195-198; Aulthouse et al. , In Vitro Dev. Biol. 1989, 25, 659- 
668, the causal relationship between a mutation in the gene and 
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the biological function of the protein has proven elusive. 

Also, failure to obtain expression of genes for human 
fibrillar collagens has made it impossible to prepare human 
fibrillar procollagens and collagens that have a number of 
5 therapeutic uses in man and that will not produce the 
undesirable immune responses that have been encountered with 
use of collagen from animal sources. 

Recently however, Applicants described the expression 
of a human type II procollagen in mouse 3T3 cells using a 
10 promoter from the human type I procollagen gene. Ala-Kokko et 
al., J. Biol. Chem. 1991, 266, 14175; Ala-Kokko et al., Matrix 

1990, 10, 234. 

SUMMARY OF THE INVENTION 

The present invention involves the preparation of gene 

15 constructs that contain collagen genes of human and other 
origins. One of the gene constructs is hybrid of a human gene 
for type I procollagen (COL1A1) and a human gene for type II 
procollagen (COL2A1) . The 5 '-end of the construct contains the 
promoter, exon 1 and intron 1 of the C0L1A1 gene fused to 

20 intron 1 of the COL2A1 gene. The construct is designed so that 
the promoter and putative enhancer in the first intron of the 
COL1A1 drive expression of the COL2A1 gene and cause production 
of human type II procollagen. The COL2A1 gene consisted of two 
Sphl/SphI fragments of the gene totalling about 26,000 base 

25 pairs. This construct contains all the coding sequences of the 
gene except for the few codons of a signal peptide in exon 1 
and an alternatively spliced exon that follows exon 1. Some 
versions of the construct also include a 3,500 base pair 
Sphl/SphI fragment from the 3 '-end of the gene that is needed 

30 for correct polyadenylation of the mRNA. 

A second construct has the promoter, the first exon, 
the intron, and about half of the second exon of the human 
C0L1A1 gene as the 5 '-fragment of the construct. The 5'- 
fragment is joined through a unique Kpnl restriction 

35 endonuclease site to a cDNA that contains all the coding 
sequences of the gene except for those, contained in the first 
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one and one-half exons. In addition, the 3' -end of the cDNA is 
linked through an EcoRI site to an EcoRI/EcoRI fragment of 
about 0.5 Jcb from the 3 '-end of the C0L1A1 gene. A series of 
additional constructs use the highly active promoter for the 
5 cytomegalic virus to drive expression of full-length cDNA, for 
the human COL1A1 gene. All the constructs have been engineered 
so that they have unique restriction endonuclease sites at 
their 5'- and 3 '-ends and, therefore, can be excised from 
vector sequences. 
10 The present invention involves transfection and 

expression of collagen gene constructs into selected cells. In 
some preferred Embodiments of the present invention, selected 
cells express one or more post-translational enzymes important 
to the biosynthesis of procollagens and collagens. For 
15 example, prolyl 4-hydroxylase is a post-translational enzyme 
important to the biosynthesis of procollagens and collagens. 
The enzyme must hydroxylate about 100 prolyl residues in the Y 
position of the repeating -Gly-X-Y tripeptide structures of 
procollagens and collagens to 4-hydroxyproline in order for the 
20 procollagens or collagens to fold into a stable triple-helical 
conformation at body temperature of the organism synthesizing 
the protein. Thus, in some preferred embodiments of the 
present invention cells which express prolyl 4-hydroxylase are 
preferred. Such cells may naturally express the post- 
25 translational enzymes, or may be transformed with genes coding 
for post-translational enzymes such as prolyl 4-hydroxylase. 
Mammalian cells, insect cells, or yeast cells are preferred. 
Mammalian cells, insect cells and yeast cells which are 
transfected with at least one set of genes coding for a post- 
30 translational enzyme such as prolyl 4-hydroxylase, may also be 
transfected with collagen gene constructs in yet other 
preferred embodiments of the present invention. The invention 
can also employ other cells that can be cultured and contain 
the necessary post translational enzymes and secretory 
35 mechanisms, such as Chinese hamster ovary cells. 



WO 93/07889 



PCT/US92/09061 



- 5 - 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a photograph showing analysis by 
polyacrylamide gel electrophoresis in SDS of the proteins 
secreted into medium by HT-1080 cells that were transfected 
5 with a gene construct containing the promoter, first exon and 
most of the first intron of the human C0L1A1 gene linked to 30 
kb fragment containing all of C0L2A1 except the first two 
exons. The cells were incubated with [ u C]proline so that the 
medium proteins could be analyzed by autoradiography (storage 

10 phosphor film analyzer) - Lane 1 shows that the unpurif ied 
medium proteins are comprised of three major polypeptide 
chains. The upper two are proal(IV) and proa2(IV) chains of 
type IV collagen that are synthesized by cells not transfected 
by the construct (not shown). The third band is the proal(II) 

15 chains of human type II procollagen synthesized from the 
construct. Lanes 2 and 3 are the same medium protein after 
chromatography of the medium on an ion exchange column (DE-52, 
Whatman, at pH 7.4 in lane 2 and at pH 7.0 in lane 3). The 
type II procollagen appeared in the void volume of the ion 

20 exchange column. 

Figure 2 is a photograph showing that the type II 
procollagen secreted into the medium from cells described in 
Figure 1 was folded into a correct native conformation. The 
medium proteins were digested at the temperatures indicated 

25 with a high concentration of trypsin and chymotrypsin under 
conditions in which correctly folded triple-helical procollagen 
or collagen resists digestion but unfolded or incorrectly 
folded procollagen of collagen is digested to small fragments 
(Bruckner and Prockop, Anal. Biochemistry 1981, 110, 360). The 

30 products of the digestion were then analyzed by polyacrylamide 
gel electrophoresis in SDS and f luorography . The results show 
that the type II procollagen resisted digestion up to 43°C, the 
normal temperature at which type II procollagen unfolds. 
Therefore, the type II procollagen is correctly folded and can 

35 be used to generate collagen fibrils. 

Figure 3 is a photograph showing analysis of medium 
of HT-1080 cells co-transfected with a gene for C0L1A1 and a 
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gene for COL1A2. THE COL1A2 was linked to an active neomycin- 
resistance gene but the COL1A1 was not. The cells were 
screened for expression of the COL1A2 -neomycin resistance gene 
construct with the neomycin analog G418. The medium was 
5 analyzed for expression of the COL1A1 by Western blotting with 
a polyclonal antibody specific for the human proal(I) chain. 
Lane 1 indicates that the medium proteins contained proa (I) 
chains. Lane 2 is an authentic standard of type I procollagen 
containing proal(I) chains and partially processed pCal(I) 

10 chains. The results demonstrate that the cells synthesized 
human type procollagen that contained proal(I) chains, 
presumably in the form of the normal heterotrimer with the 
composition two proa (I) chains and one proa2(I) chain. 

Figure 4 is a schematic representation of the cDNA for 

15 the proal(I) chain of human type I procollagen that has been 
modified to contain artificial sites for cleavage by specific 
restriction endonucleases . 

Figure 5 is a photograph showing analysis by non- 
denaturing 7.5% polyacrylamide gel electrophoresis (lanes 1-3) 

20 and 10% polyacrylamide gel electrophoresis in SDS (lanes 4-6) 
of purified chick prolyl 4-hydroxylase (lanes 1 and 4) and the 
proteins secreted into medium by Sf 9 cells expressing the gene 
for the a-subunit and the 0-subunit of human prolyl 4- 
hydroxylase and infected with a58/j5 virus (lanes 2 and 5) or 

25 with a59/0 virus (lanes 3 and 6). a58//3 and a59//3 differ by a 
stretch of 64 base pairs. Lanes 1-3 are protein separated 
under non-denaturing conditions and showing tetramers of the 
two kinds of subunits. Lanes 4-6 are the same samples 
separated under denaturing conditions so that the two subunits 

30 appear as separate bands. 

DETAILED DESCRIPTION OF THE INVENTION 

It has been established that most forms of 
osteogenesis imperfecta (01) are caused by dominant mutations 
in one of the two genes for type I procollagen. Also, at least 
35 a subset of post-menopausal osteoporosis is caused by similar 
mutations in the two genes for type I procollagen. It has 
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further been reported that mutations in the type II procollagen 
gene cause human diseases such as chondrodysplasia, and a 
subset of primary generalized osteoarthritis. It has further 
been reported that mutations in the type III procollagen gene 
5 (COL3A1) cause human diseases such as a lethal variant of 
Ehlers-Danlos syndrome (type IV) and familial aneurysms. 
Moreover , it has been demonstrated that the kidney disease 
known as the Alport syndrome is caused by mutations in one of 
the genes (COL4A5) for type IV collagen. It has further been 

10 demonstrated that injections of suspensions of collagen fibers 
are effective for the treatment of cosmetic defects as well as 
physical weakness of tissues such as sphincters. 

The present invention concerns cells in which one 
of these fibrillar procollagens is expressed both as mRNA and 

15 as a protein. Additionally, the present invention concerns 
types I, II, and III procollagens expressed in a mammalian cell 
line, an insect cell line, or a yeast cell line, and the 
establishment of transfected cell lines comprising these 
procollagen genes. 

20 The present invention further provides that the gene 

constructs can be used to synthesize human fibrillar 
procollagens in the HT-1080 human tumor cell line. This human 
cell line has been a ready source of type IV collagen, the 
major collagen of basement membranes. Because type IV collagen 

25 is not a fibril-forming procollagen or collagen, it can be 
readily separated by a simple chromatographic procedure from 
any fibrillar procollagen. Hence, the invention provides 
methods whereby a human fibrillar procollagen can be readily 
separated from products of an endogenous collagen gene. 

30 Moreover, HT-1080 cells grow extremely rapidly in culture and 
can be maintained for long periods of time. 

Additionally, the present invention provides for a 
single procollagen or collagen gene or a number of different 
procollagen or collagen genes expressed within a cell. 

35 Further, it is contemplated that the there can be a one or more 
copies of a single procollagen or collagen gene or of the 
number of different such genes transfected into cells and 
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expressed. The present invention provides that these cells can 
be transfected so that they express at least one human 
procollagen gene, especially but not limited to the COL1A1 gene 
encoding the proal(I) procollagen chain of human type I 
5 procollagen. It is also provided that the cells can be 
transfected with and express both C0L1A1 and COL1A2 genes so 
that both proa2(I) and proal(I) chains are simultaneously 
synthesized and assembled into normal heterotrimeric molecules 
of type I procollagen. Moreover, the present invention 

10 provides that cells can be transfected with and express the 
C0L2A1 gene encoding the proal(II) chain of human type II 
procollagen. It is further provided that ceils can be 
transfected with and express the COL3A1 gene encoding the 
proal(III) chain of type III procollagen . The invention also 

15 provides that any procollagen or collagen gene transfected into 
and expressed within cells may comprise a mutant, variant, 
hybrid or recombinant gene. Such mutant, variant, hybrid or 
recombinant gene may include a mutation which provides unique 
restriction sites for cleavage of the hybrid gene. In some 

20 preferred embodiments of the present invention, mutations 
providing one or more unique restriction sites do not alter the 
amino acid sequence encoded by the gene, but merely provide 
unique restriction sites useful for manipulation of the gene. 
Thus, the modified gene would be made up of a number of 

25 discrete regions, or D-regions, flanked by unique restriction 
sites. These discrete regions of the gene are herein referred 
to as cassettes. For example, cassettes designated as Dl 
through D4.4 are shown in Figure 4. Multiple copies of a gene 
cassette is another variant of the present gene which is 

30 encompassed by the present invention. Recombinant or mutant 
genes or cassettes which provide desired characteristics such 
as resistance to endogenous enzymes such as collagenase are 
also encompassed by the present invention. Further, the present 
invention provides transfected cells substantially all of which 

35 comprise other procollagen or collagen genes, preferably but 
not limited to types I, II, III procollagen genes or type IV 
collagen genes. The present invention contemplates that 
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transfected cells may be mammalian cells such as human tumor 
cells, especially but not limited to HT-1080 cells. In other 
embodiments of the present invention, transfected cells are 
insect cells such as baculovirus Sf9 cells. In still other 
5 embodiments of the present invention, transfected cells are 
yeast cells, such as Saccharomyces cerevisiae or Pichia 
pastoris cells. In preferred embodiments of the present 
invention, cells such as mammalian, insect and yeast cells, 
which may not naturally produce sufficient amounts of post 
10 translational enzymes, are transformed with at least one set of 
genes coding for a post-translational enzyme such as prolyl 4- 
hydroxylase. 

The present invention further contemplates cells 
substantially all of which comprise at least one transfected 

15 human procollagen or collagen gene having at least one chain 
derived from a transfected or collagen procollagen gene or 
genes and at least one chain derived from an endogenous human 
or non-human procollagen gene or genes, other than the 
[proal (I) ] 2 proa2 (I) collagen molecule consisting of human 

20 proal (I) moieties and non-human proa2(I) moieties, or non-human 
proal (I) moieties and human proa2(I) moieties. 

A novel feature of the methods of the invention is 
that relatively large amounts of a human fibrillar procollagen 
can be synthesized in a recombinant cell culture system that 

25 does not make any other fibrillar procollagen. Systems that 
make other fibrillar procollagens or collagens are impractical 
because of the extreme difficulty of purifying the product of 
the endogenous genes for fibrillar procollagen or collagen from 
products of the recombinant genes. Using methods of the 

30 present invention, purification of human procollagen is greatly 
facilitated. Moreover, it has been demonstrated that the 
amounts of protein synthesized by the methods of the present 
invention are high relative to other systems used in the art. 

Other novel features of the methods of present 

35 invention are that procollagens synthesized are correctly 
folded proteins so that they exhibit the normal triple-helical 
conformation characteristic of procollagens and collagens. 
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Therefore, the procollagens can be used to generate stable 
collagen fibrils and fibers by cleavage of the procollagens 
with proteases. 

The present invention is in contrast to Schnieke et 
5 al., who reported that a gene for the human fibrillar 
procollagen proal(I) chain, the COL1A1 gene, can be expressed 
in mouse fibroblasts and the chains used to assemble molecules 
of type I procollagen, the precursor of type I collagen. 
However, in the system of Schnieke et al., the proa2(I) chains 

10 found in the molecule of type I procollagen were of mouse 
origin. Hence , the type I procollagen synthesized is a hybrid 
molecule of human and mouse origin. Similarly, the system of 
Lee et al. expressed an exogenous procr2(I) gene to generate 
type I procollagen in which the proal(I) chains were of rat 

15 origin. The present invention provides methods for the 
production of procollagens or collagens derived solely from 
transfected procollagen and collagen genes, but these methods 
are not limited to the production of procollagen and collagen 
derived solely from transfected genes. 

20 An advantage of human collagens of the present 

invention is that these collagens will not produce allergic 
responses in man. Moreover, collagen of the present invention 
prepared from cultured cells should be of a higher quality than 
collagen obtained from animal sources, and should form larger 

25 and more tightly packed fibers. These higher quality proteins 
should form deposits in tissues that last much longer than the 
currently available commercial materials. It is known that 
using currently available methods, most injections of collagen 
for cosmetic purposes have to be repeated as frequently as 

30 every 6 months. Human protein of the present invention should 
last much longer after injection into human tissues. 

Methods of the present invention provide a practical 
source of a human fibrillar collagen similar to animal 
collagens that are widely used for injection to remove cosmetic 

35 wrinkles, and cosmetic defects of other natures, and are also 
being used to restore the tensile strength of tissues such as 
the sphincter of the bladder in the treatment of urinary 
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incontinence. Animal collagens are also used in mixtures with 
ceramics and other materials to fill in defects in bone and 
enhance bone growth. Type I collagen from animal sources has 
been used commercially. However, a convenient source of human 
5 collagen for therapeutic use is still sorely needed. 

Further, the present invention contemplates that human 
type II procollagen, the precursor of the major collagen of 
cartilage may have special use in the repair of cartilage 
damage. Moreover, modified human type I procollagen comprising 

10 a proal(I) trimer expressed according to the methods in the 
present invention is also contemplated. Also, type I 
procollagen comprised of two proal(I) and one procr2(I) chains 
derived from transfected human genes is contemplated. Also, 
type III procollagen comprised of three proal(III) chains 

15 derived from transfected human genes is contemplated. In 
addition, specifically engineered forms of these collagens are 
contemplated. 

Methods are provided for synthesizing fibrillar 
collagen in cells comprising transfecting at, least one human 

20 procollagen or collagen gene into cells and selecting 
transfected cells that comprise molecules derived from a 
procollagen or collagen gene or genes, other than the 
[proal (I) ] 2 proa2 (I) molecule consisting of human proal(I) 
moieties and non-human proa2(I) moieties, or non-human al(I) 

25 moieties and human a2(I) moieties. Further, methods whereby at 
least one of the human procollagen genes is a mutant, variant, 
hybrid or recombinant gene are also contemplated. 
Additionally, the present invention provides methods whereby 
substantially all cells transfected with at least one 

30 procollagen gene comprise type III and other procollagen genes. 
Further, methods are contemplated wherein transfected cells are 
human tumor cells, especially but not limited to HT-1080 cells. 
Methods are also provided whereby transfected cells comprise 
independently substantially no endogenously derived collagen 

35 molecules, endogenously derived type I procollagen molecules, 
endogenously derived type II procollagen molecules, 
endogenously derived type III procollagen molecules, or 
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endogenously derived type IV collagen molecules. Other methods 
are provided whereby substantially all of the transf ected cells 
comprise at least one transfected human procollagen gene and 
express procollagen or collagen molecules having at least one 
5 chain derived from the transfected gene, other than the 
[proal(I) ] 2 proa2(I) collagen consisting of human proal(I) 
moieties and non-human proa2(I) moieties, or non-human proal(I) 
moieties and human proa2(I) moieties. Other preferred methods 
are provided whereby substantially all transfected cells 

10 comprise at least one transfected human procollagen gene and 
express procollagen molecules having three chains derived from 
the transfected collagen gene or genes. 

The present invention is further illustrated by the 
following examples, which are not intended to be limiting in 

15 any way. 

EXAMPLES 

Example 1 Synthesis of Human Type II Procollagen 

A recombinant C0L1A1 gene construct employed in the 
present invention comprised a fragment of the 5 '-end of COL1A1 

20 having a promotor, exon 1 and intron 1 fused to exons 3 through 
54 of a COL2A1 gene. The hybrid construct was transfected into 
HT-1080 cells. These cells were co-transf ected with a 
neomycin-resistance gene and grown in the presence of the 
neomycin analog G418. The hybrid construct was used to 

25 generate transfected cells. 

A series of clones were obtained that synthesized mRNA 
for human type II procollagen. To analyze the synthesized 
proteins, the cells were incubated with [ 14 C]proline and the 
14 C-labeled medium proteins wee analyzed by gel electrophoresis. 

30 See Figure 1. As indicated in Lane 1, the medium proteins 
contained the expected type II procollagen comprised of 
proal(II) chains together with proal(IV) and proa2(IV) chains 
of type IV collagen normally synthesized by the cells. As 
indicated in Lanes 2 and 3, the type II procollagen was readily 

35 purified by a single step of ion exchange chromatography. The 
type II procollagen secreted into the medium was correctly 
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folded by a protease- thermal stability test. See Figure 2. 

Example 2 Synthesis of Human Type I Procollagen 

As a second example, HT-1080 cells were co-transf ected 
with a COL1A1 gene and a COL1A2 gene. Both genes consisted of 
5 a cytomegalic virus promoter linked to a full-length cDNA. The 
COL1A2 gene construct but not the COL1A1 gene construct 
contained a neomycin-resistance gene. The cells were selected 
for expression of the C0L1A2 -neomycin resistance gene construct 
by growth in the presence of the neomycin-analog G418. The 

10 medium was then examined for expression of the COL1A1 with a 
specific polyclonal antibody for human proal(I) chains. The 
results (see Figure 3) demonstrated that the cells synthesized 
human type I procollagen that was probably comprised of the 
normal heterotrimeric structure of two proal(I) chains and one 

15 proa2(I) chain. 

Table 1 presents a summary of the DNA constructs 
containing human procollagen genes. The constructs were 
assembled from discrete fragments of the genes or cDNAs from 
the genes together with appropriate promoter fragments. 
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Example 3 Cell Trans f ections 

For cell transfection experiments, a cosmid plasmid 
clone containing the gene construct was cleaved with a 
restriction endonuclease to release the construct from the 
5 vector, A plasmid vector comprising a neomycin resistance 
gene, Law et al., Molec. Cell Biol. 1983, 3, 2110-2115, was 
linearized by cleavage with BamHI . The two samples were mixed 
in a ratio of approximately 10:1 gene construct to neomycin- 
resistant gene, and the mixture was then used for co- 

10 transfection of HT-1080 cells by calcium phosphate co- 
precipitation, Sambrook et al., Molecular Cloning. A 
Laboratory Manual . Cold Spring Harbor Laboratory Press, Second 
Edition (1989) . DNA in the calcium phosphate solution was 
layered onto cultured cells with about 10/ug of chimeric gene 

15 construct per 100 ml plate of preconfluent cells. Cells were 
incubated in DMEM containing 10% newborn calf serum for 10 
hours. The samples were subjected to glycerol shock by adding 
a 15% glycerol solution for 3 minutes. The cells were then 
transferred to DMEM medium containing newborn calf serum for 

20 24 hours and then to the same medium containing 450 ng/ml of 
G418. Incubation in the medium containing G418 was continued 
for about 4 weeks with a change of medium every third day. 
G418-resistant cells were either pooled or separate clones 
obtained by isolating foci with a plastic cylinder and 

25 subcultured. 

Example 4 Western blotting 

For assay of expression of the COL2A1 gene, 
polyclonal antibodies were prepared in rabbits using a 23- 
residue synthetic peptide that had an amino acid sequence 

30 found in the COOH-terminal telopeptide of type II collagen. 
See Cheah et al., Proc. Natl. Acad. Sci. USA 1985, 82, 2555- 
2559. The antibody did not react by Western blot analysis 
with proa chains of human type I procollagen or collagen, 
human type II procollagen or collagen, or murine type I 

35 procollagen. For assay of expression of the COL1A1 genes, 
polyclonal antibodies that reacted with the COOH-terminal 
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polypeptide of the proal(I) chain were employed- See Olsen 
et al., J". Biol. Chem. 1991, 266 , 1117-1121. 

Culture medium from .pooled clones or individual 
clones was removed and separately precipitated by the addition 
5 of solid ammonium sulfate to 30% saturation and precipitates 
were collected by centrifugation at 14 , 000 x g and then 
dialyzed against a buffer containing 0.15 M NaCl, 0.5 mM EDTA, 
0.5 mM N-ethylmaleimide, 0.1 mM and p-aminobenzamidine, and 
50 mM Tris-HCl (pH 7.4 at 4°C) . Aliquots of the samples were 

10 heated to 10°C for 5 minutes in 1% SDS, 50 mM DTT and 10% 
(v/v) glycerol, and separated by electrophoresis on 6% 
polyacrylamide gels using a mini-gel apparatus (Holford SE250, 
Holford Scientific) run at 125 V for 90 minutes. Separated 
proteins were electiroblotted from the polyacrylamide gel at 

15 40 V for 90 minutes onto a supported nitrocellulose membrane 
(Schleicher and Schuell) . The transferred proteins were 
reacted for 30 minutes with the polyclonal antibodies at a 
1:500 (v/v) dilution. Proteins reacting with the antibodies 
were detected with a secondary anti-rabbit igG antibody 

20 coupled to alkaline phosphatase (Promega Biotech) for 30 
minutes. Alkaline phosphatase was visualized with NBT/BCIP 
(Promega Biotech) as directed by the manufacturer. 

Example 5 Demonstration of Correct Folding 

of the secreted Procollagens 

25 To demonstrate that the procollagens synthesized and 

secreted in the medium by the transfected cells were correctly 
folded, the medium proteins were digested with high 
concentrations of proteases under conditions in which only 
correctly folded procollagens and collagens resist digestion. 

30 For digestion with a combination of trypsin and chymotrypsin, 
the cell layer from a 25 cm flask was scraped into 0.5 ml of 
modified Krebs II medium containing 10 mM EDTA and 0.1% 
Nonidet P-40 (Sigma) . The cells were vigorously agitated in 
a Vortex mixer for 1 minute and immediately cooled to 4°C. 

35 The supernatant was transferred to new tubes. The sample was 
preincubated at the temperature indicated for 10 minutes and 
the digestion was carried out at the same temperature for 2 
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minutes. For the digestion, a 0.1 volume of the modified 
Krebs II medium containing 1 mg/ml trypsin and 2.5 mg/ml a- 
chymotrypsin (Boehringer Mannheim) was added. The digestion 
was stopped by adding a 0.1 volume of 5 mg/ml soybean trypsin 
5 inhibitor (Sigma) . 

For analysis of the digestion products, the sample 
was rapidly immersed in boiling water for 2 minutes with the 
concomitant addition of a 0.2 volume of 5 x electrophoresis 
sample buffer that consisted of 10% SDS, 50% glycerol, and 

10 0.012% bromphenol blue in 0.625 M Tris-HCl buffer (pH 6.8). 
Samples were applied to SDS gels with prior reduction by 
incubating for 3 minutes in boiling water after the addition 
of 2% 2-mercaptoethanol. Electrophoresis was performed using 
the discontinuous system of Laemmli, Nature 1979, 227, 680- 

15 685, with minor modifications described by de Wet et al., 
Journal of Biological Chemistry 1983, 258, 7721-7728. 

Example 6 Specifically Engineered 

Procollagens and Collagens 

As indicated in Figure 4, a hybrid gene consisting 

20 of some genomic DNA and some cDNA for the prool(I) chain of 
human type I procollagen was the starting material. The DNA 
sequence of the hybrid gene was analyzed and the codons for 
amino acids that formed the junctions between the repeating 
D-periods were modified in ways that did not change the amino 

25 acids encoded but did create unique sites for cleavage of the 
hybrid gene by restriction endonucleases. 

A. Recombinant procollagen or collagen 
The D3-period of proal(I) is excised using Srfl and 
Nael restriction nucleases. The bases coding for the amino 

30 acids found in the collagenase recognition site present in the 
D3 period are modified so that they code for a different amino 
acid sequence. The cassette is amplified and reinserted in the 
gene. Expression of the gene in an appropriate host cell will 
result in type I collagen which can not be cleaved by 

35 collagenase. 
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B. Procollagen or collagen deletion mutants 

A D2 period cassette (of the proal(I) chain) is 
excised from the gene described above by digestion with Smal. 
The gene is reassembled to provide a gene having a specific 
5 in- frame deletion of the codons for the D2 period. 

C. Procollagen or collagen addition mutants 
Multiple copies of one or more D-cassettes may be 

inserted at the engineered sites to provide multiple copies 
of desired regions of procollagen or collagen. 

10 Example 7 Expression of Human Prolyl 

4 -Hydroxylase in a Recombinant DKA System 

To obtain expression of the two genes for prolyl 4- 

hydroxylase in insect cells, the following procedures were 

carried out. The baculovirus transfer vector pVLa58 was 

15 constructed by digesting a pBluescript (Stratagene) vector 
containing in the Smal site the full-length cDNA for the a 
subunit of human prolyl 4-hydroxylase, PA-58 (Helaakoski, T. 
et al., Proc. Natl. Acad. Sci. USA 1989, 86, 4392-4396), with 
PstI and BamHI, the cleavage sites which closely flank the 

20 Smal site. The resulting Pstl-Pstl and Pstl-BamHI fragments 
containing 61 bp of the 5' untranslated sequence, the whole 
coding region, and 551 bp of the 3' untranslated sequence were 
cloned to the Pstl-BamHI site for the baculovirus transfer 
vector pVL1392 (Luckow, V.A. and Summers, M.D., Virology 1989, 

25 170, 31-39) . The baculovirus transfer vector pVLa59 was 
similarly constructed from pVL1392 and another cDNA clone, PA- 
59 (Helaakoski, T. et al. , supra), encoding the a subunit of 
human prolyl 4-hydroxylase. The cDNA clones PA-58 and PA-59 
differ by a stretch of 64 bp. 

30 The pVL/3 vector was constructed by ligation of an 

EcoRI-BamHI fragment of a full-length cDNA for the 0 subunit 
of human prolyl 4-hydroxylase, S-138 (Pihlajaniemi, T. et al., 
EMBO J. 1987, 6, 643-649) containing 44 bp of the 5' 
untranslated sequence, the whole coding region, and 207 bp of 

35 the 3' untranslated sequence to EcoRI/BamHI-digested pVL1392. 
Recombinant baculovirus transfer vectors were cotransf ected 
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into Sf9 cells (Summers, M.D. and Smith, G.E., Tex. Agric. 
Exp. St. Bull. 1987, 1555, 1-56) with wild-type Autographa 
californica nuclear polyhedrosis virus (AcNPV) DNA by calcium 
phosphate transf ection. The resultant viral pool in the 
5 supernatant of the transfected cells was collected 4 days 
later and used for plaque assay. Recombinant occlusion- 
negative plaques were subjected to three rounds of plaque 
purification to generate recombinant viruses totally free of 
contaminating wild-type virus. The screening procedure and 
10 isolation of the recombinant viruses essentially followed by 
the method of Summers and Smith, supra. The resulting 
recombinant viruses from pVLcr58, pVLa59, and pvLjff were 
designated as the cr58 virus, a59 virus and 0 virus, 
respectively. 

15 Sf9 cells were cultured in TNM-FH medium (Sigma) 

supplemented with 10% fetal bovine serum at 27 °C either as 
monolayers or in suspension in spinner flasks (Techne) . To 
produce recombinant proteins, Sf9 cells seeded at a density 
of 10 6 cells per ml were injected at a multiplicity of 5-10 

20 with recombinant viruses when the a58, a59, or 0 virus was 
used alone. The a and 0 viruses were used for infection in 
ratios of 1:10-10:1 when producing the prolyl 4-hydroxylase 
tetramer. The cells were harvested 72 hours after infection, 
homogenized in 0.01 M Tris, pH 7.8/0.1 M NaCl/0.1 M glycine/ 10 

25 /LiM dithiothreitol/0. 1% Triton X-100, and centrifuged. The 
resulting supernatants were analyzed by SDS/10% PAGE or 
nondenaturing 7.5% PAGE and assayed for enzyme activities. 
The cell pellets were further solubilized in 1% SDS and 
analyzed by SDS/10% PAGE. The cell medium at 24-96 hours 

30 postinfection was also analyzed by SDS/10% PAGE to identify 
any secretion of the resultant proteins into the medium. The 
cells in these experiments were grown in TNM-FH medium without 
serum. 

When the time course of protein expression was 
35 examined, Sf9 cells infected with recombinant viruses were 
labeled with [ 35 S] methionine (10 iiCi/pl; Amersham; l Ci=37 
CBq) for 2 hours at various time points between 24 and 50 
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hours after infection and collected for analysis by SDS/10% 
PAGE. To determine the maximal accumulation of recombinant 
protein , cells were harvested at various times from 24 to 96 
hours after infection and analyzed on by SDS/10% PAGE. Both 
5 the 0.1% Triton X-100- and 1% SDS-soluble fractions of the 
cells were analyzed. Prolyl 4 -hydroxylase activity was 
assayed by a method based on the decarboxylation of 2-oxo[l- 
14 C]glutarate (Kivirikko, K.I., and Myllyla, R. , Methods 
Enzymol. 1982 , 82 , 245-304). The Km values were determined 

10 by varying the concentrations of one substrate in the presence 
of fixed concentration of the second, while the concentrations 
of the other substrates were held constant (Myllyla, R. , 
Tuderman, L., and Kivirikko, K.I., Eur. J. Biochem. 1977, 80, 
349-357). Protein disulf ide-isomerase activity of the 0 

15 subunit was measured by glutathione: insulin transhydrogenase 
assay (Carmichael et al. , J. Biol. Chem. 1977, 252, 7163- 
7167) . Western blot analysis was performed using a monoclonal 
antibody, 5B5, to the j8 subunit of human prolyl 4-hydroxylase 
(Hoyhtya, M.et al., Eur. J. Biochem. 1984, 141, 477-482). 

20 Prolyl 4-hydroxylase was purified by a procedure consisting 
of poly(L-proline) affinity chromatography, DEAE-cellulose 
chromatography, and gel filtration (Kivirikko, K.I., and 
Myllyla, R. , Methods Enzymol. 1987, 144, 96-114). 

Figure 5 presents analysis of the prolyl 4- 

25 hydroxylase synthesized by the insect cells after purification 
of the protein by affinity-column chromatography. When 
examined by polyacrylamide gel electrophoresis in a non- 
denaturing gel, the recombinant enzyme co-migrated with the 
tetrameric and active form of the normal enzyme purified from 

30 chick embryos. After the purified recombinant enzyme was 
reduced, the a- and /J- subunits were detected. Table 2 
presented data on the enzymic activity of the recombinant 
enzyme. The Km values were determined by varying the 
concentration of one substrate in the presence of fixed 

35 concentrations of the second while the concentration of the 
other substrates were held constant. 
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TABLE 2 



— ;n ' — : 


Kid value, iM 


Substrate 


i ot58,0, 


Ct59,/3, 


Chick enzyme 


Fe* 2 


4 


4 


4 


2 -oxoglutarate 


22 


25 


22 


ascorbate 


330 


330 


300 


(Pro-Pro-Gly) in 


18 


18 


15-20 



As indicated, the Michales-Menton (Km) values for 
the recombinant enzyme were the same as for the authentic 
* normal enzyme from chick embryos. 

10 Since the transfected insect cells synthesize large 

amounts of active prolyl 4 -hydroxylase, they are appropriate 
cells to transfect with genes of the present invention coding 
for procollagens and collagens so as to obtain synthesis of 
large amounts of the procollagens and collagens. Transfection 

15 of the cells with genes of the present invention is performed 
as described in Example 3 . 

Example 8 Expression of Recombinant Collagen Genes in 

Sacchharomyces cerevisiae Yeast Expressing 
Recombinant Genes for Prolyl 4-Hydroxylase 

20 The yeast Saccharomyces cerevisiae can be used with 

any of a large number of expression vectors. One of the most 
commonly employed expression vectors is the multi-copy 2fx 
plasmid that contains sequences for propagation both in yeast 
and E* coli, a yeast promoter and terminator for efficient 

25 transmission of the foreign gene. Typical examples of such 
vectors based on 2 m plasmids are pWYG4 that has the 2 M ORI- 
STB elements, the GAL1 promoter, and the 2/x D gene terminator. 
In this vector an Ncol cloning site containing the ATG that 
is used to insert the gene for either the a or 0 subunit of 

30 prolyl 4 -hydroxylase. As another example, the expression 
vector can be pWYG7L that has intact 2m ORI, STB r REP1 and 
REP2, the GAL7 promoter, and uses the FLP terminator. In this 
vector, the gene for either the a or (3 subunit of prolyl 4- 
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hydroxylase is inserted in the polylinker with its 5' ends at 
a BamHI or Ncol site. The vector containing the prolyl 4- 
hydroxylase gene is transformed into S. cerevisiae either 
after removal of the cell wall to produce spheroplasts that 
5 take up DNA on treatment with calcium and polyethylene glycol 
or by treatment of intact cells with lithium ions. 
Alternatively, DNA can be introduced by electroporation. 
Transformants can be selected by using host yeast cells that 
are auxotrophic for leucine, tryptophane, uracil or histidine 

10 together with selectable marker genes such as LEU2, TRP1, 
URA3, HIS3 or LEU2-D. Expression of the prolyl 4-hydroxylase 
genes driven by the galactose promoters can be induced by 
growing the culture on a non-repressing, non-inducing sugar 
so that very rapid induction follows addition of galactose; 

15 by growing the culture in glucose medium and then removing the 
glucose by centrifugation and washing the cells before 
resuspension in galactose medium; and by growing the cells in 
medium containing both glucose and galactose so that the 
glucose is preferentially metabolized before galactose- 

20 induction can occur. Further manipulations of the transformed 
cells are performed as described above to incorporate genes 
for both subunits of prolyl 4-hydroxylase and desired collagen 
or procollagen genes into the cells to achieve expression of 
collagen and procollagen that is adequately hydroxylated by 

25 prolyl 4-hydroxylase to fold into a stable triple helical 
conformation and therefore accompanied by the requisite 
folding associated with normal biological function. 

Example 9 Expression of Recombinant Collagen Genes in 

Pichia pastor is Yeast Expressing Recombinant 
30 Genes for Prolyl 4 -Hydroxylase 

Expression of the genes for prolyl 4-hydroxylase and 

procollagens or collagens can also be in non-Saccharomyces 

yeast such as Pichia pastoris that appear to have special 

advantages in producing high yields of recombinant protein in 

35 scaled-up procedures. Typical expression in the methylotroph 

P. pastoris is obtained by the promoter from the tightly 

regulated AOX1 gene that encodes for alcohol oxidase and can 
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be induced to give high levels of recombinant protein driven 
by the promoter after addition of methanol to the cultures. 
Since P. Pastoris has no native plasmids, the yeast is 
employed with expression vectors designed for chromosomal 
5 integration and genes such as HIS4 are used for selection. 
By subsequent manipulations of the same cells expression of 
genes for procollagens and collagens described herein is 
achieved under conditions where the recombinant protein is 
adequately hydroxylated by prolyl 4 -hydroxylase and, 
10 therefore, can fold into a stable helix that is required for 
the normal biological function of the proteins in forming 
fibrils. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Prockop, Darwin J. 

Ala-Kokko, Leena 
5 Fertala / Andrzej 

Sieron, Aleksander 
Kivirikko, Kari I* 
Geddis, Amy 

(ii) TITLE OF INVENTION: Synthesis of Human Procollagens 

10 and 

Collagens in Recombinant DNA Systems 
(iii) NUMBER OF SEQUENCES: 7 

(iv) CORRESPONDENCE ADDRESS: 
!5 (A) ADDRESSEE: Woodcock Washburn Kurtz Mackiewicz 

and Norris 

(B) STREET: One Liberty Place - 46th Floor 

(C) CITY: Philadelphia 

(D) STATE: PA 

20 (E) COUNTRY: U.S.A. 

(F) ZIP: 19103 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

25 (C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

30 (C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Licata, Jane M. 

(B) REGISTRATION NUMBER: 32,257 

(C) REFERENCE/ DOCKET NUMBER: TJU-0733 

35 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 215-568-3100 

(B) TELEFAX: 215-568-3439 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Arg Tyr His Asp 
1 

5 (2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..12 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

AGG TAC CAT GAC 12 
Arg Tyr His Asp 
1 



(2) INFORMATION FOR SEQ ID NO: 3: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Phe Pro Gly Ala 
1 

(2) INFORMATION FOR SEQ ID NO: 4: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
Leu Pro Gly Pro 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
15 (B) LOCATION: 1..12 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CTC CCT GGT CCT 
Leu Pro Gly Pro 



5 



1 



1 



20 (2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 



25 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. .12 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



CTG CCC GGG CCT 
Leu Pro Gly Pro 
1 



12 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Ala Ala Gly Arg 
10 1 
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CLAIMS 

What is claimed: 

1. Cells, substantially all of which comprise at 
least one transfected human procollagen or collagen gene, and 
5 express procollagen or collagen molecules having at least one 
chain derived from said procollagen or collagen gene or genes, 
other than the [proal(I) ] 2 proa2(I) collagen molecule consisting 
of human proal(I) moieties and non-human proa2(I) moieties, or 
non-human proal(I) moieties and human proa2(I) moieties. 

10 2. The cells of claim 1 having procollagen or 

collagen molecules in which the three chains of said 
procollagen or collagen molecules are derived from said 
transfected gene. 

3. The cells of claim 1 wherein one of said human 
15 procollagen genes is the COL1A1 gene encoding the proal(I) 

chain of human type I procollagen. 

4. The cells of claim 3 wherein a second of said 
human procollagen genes is the COL1A2 gene encoding the 
proa2(I) chain of human type I procollagen. 

20 5. The cells of claim 1 wherein one of said human 

procollagen genes is the COL2A1 gene encoding the proal(II) 
chain of human type II procollagen. 

6. The cells of claim 1 wherein one of said human 
procollagen genes is the C0L3A1 genes encoding the proal(III) 

25 chain of the human type III procollagen. 

7 . The cells of claim 1 wherein at least one of said 
genes is a mutant, variant, hybrid or recombinant gene. 

8. The cells of claim 1 being mammalian cells. 

9. The cells of claim 8 being human tumor cells. 
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10. The cells of claim 8 wherein said cells are 
transfected with a post-translational enzyme. 

11. The cells of claim 10 wherein the post- 
translational enzyme is prolyl 4 -hydroxylase. 

5 12. The cells of claim 1 being insect cells. 

13. The cells of claim 12 wherein said cells are 
transfected with a post-translational enzyme. 



14. The cells of claim 13 wherein said post- 
translational enzyme is prolyl 4 -hydroxylase. 

10 15. The cells of claim 1 being yeast cells. 

16. The cells of claim 15 wherein said cells are 
transfected with a post-translational enzyme. 

17. The cells of claim 16 wherein said post- 
translational enzyme is prolyl 4 -hydroxylase. 



15 18. A method for synthesizing procollagen or collagen 

in cells comprising: 

transfecting at least one procollagen or collagen gene 
into cells; 

culturing said cells under conditions such that said 
20 transfected procollagen or collagen genes are expressed; 

selecting transfected cells that comprise at least one 
molecule derived from said procollagen or collagen gene or 
genes, other than the [proctl (I) ] 2 proa2 (I) collagen molecule 
. consisting of human proal(I) moieties and non-human proa2(I) 
25 moieties, or non-human proal(I) moieties and human proa2(I) 
moieties. 



19. The method of claim 18 wherein one of said human 
procollagen genes is the COL1A1 gene encoding the proal (I) 
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chain of human type I procollagen. 

20. The method of claim 19 wherein a second of said 
procollagen genes is the COL1A2 gene encoding the proot2(I) 
chain of human type I procollagen. 

5 21. The method of claim 18 wherein one of said human 

procollagen genes is the COL2A1 gene encoding the proal(II) 
chain of human type II procollagen. 

22. The method of claim 18 wherein one of said human 
procollagen genes is the COL3A1 gene encoding the proal (III) 

10 chain of human type III procollagen. 

23. The method of claim 18 wherein at least one of 
said genes is a mutant, variant, hybrid or recombinant gene. 

24. The method of claim 18 wherein said cells are 
mammalian cells. 

15 25. The method of claim 24 wherein said cells are 

human tumor cells. 

26. The method of claim 24 wherein said cells are 
transfected with a post-translational enzyme. 

27. The method of claim 26 wherein said post- 
20 translational enzyme is prolyl 4 -hydroxylase. 

28. The method of claim 18 wherein said cells are 
insect cells. 

29. The method of claim 28 wherein said cells are 
transfected with a post- translational enzyme. 



25 30. The cells of claim 29 wherein said post- 

translational enzyme is prolyl 4 -hydroxylase. 
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31. The method of claim 18 wherein said cells are 
yeast cells. 

32. The method of claim 31 wherein said cells are 
transfected with a post-translational enzyme. 

5 33. The method of claim 32 wherein said post- 

translational enzyme is prolyl 4 -hydroxylase. 

34. A collagen produced by the cells of claim 1. 



35. A collagen produced by the method of claim 18. 
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FIG. I 
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